AITopics | coreference model

Collaborating Authors

coreference model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Mention Annotations Alone Enable Efficient Domain Adaptation for Coreference Resolution

Gandhi, Nupoor, Field, Anjalie, Strubell, Emma

arXiv.org Artificial IntelligenceMay-30-2023

Although recent neural models for coreference resolution have led to substantial improvements on benchmark datasets, transferring these models to new target domains containing out-of-vocabulary spans and requiring differing annotation schemes remains challenging. Typical approaches involve continued training on annotated target-domain data, but obtaining annotations is costly and time-consuming. We show that annotating mentions alone is nearly twice as fast as annotating full coreference chains. Accordingly, we propose a method for efficiently adapting coreference models, which includes a high-precision mention detection objective and requires annotating only mentions in the target domain. Extensive evaluation across three English coreference datasets: CoNLL-2012 (news/conversation), i2b2/VA (medical notes), and previously unstudied child welfare notes, reveals that our approach facilitates annotation-efficient transfer and results in a 7-14% improvement in average F1 without increasing annotator time.

annotation, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2210.07602

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(13 more...)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Industry:

Health & Medicine > Health Care Technology > Medical Record (0.46)
Education > Social Development & Welfare (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Parallel Data Helps Neural Entity Coreference Resolution

Tang, Gongbo, Hardmeier, Christian

arXiv.org Artificial IntelligenceMay-28-2023

Coreference resolution is the task of finding expressions that refer to the same entity in a text. Coreference models are generally trained on monolingual annotated data but annotating coreference is expensive and challenging. Hardmeier et al.(2013) have shown that parallel data contains latent anaphoric knowledge, but it has not been explored in end-to-end neural models yet. In this paper, we propose a simple yet effective model to exploit coreference knowledge from parallel data. In addition to the conventional modules learning coreference from annotations, we introduce an unsupervised module to capture cross-lingual coreference knowledge. Our proposed cross-lingual model achieves consistent improvements, up to 1.74 percentage points, on the OntoNotes 5.0 English dataset using 9 different synthetic parallel datasets. These experimental results confirm that parallel data can provide additional coreference knowledge which is beneficial to coreference resolution tasks.

computational linguistic, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2305.17709

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Beijing > Beijing (0.05)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(15 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.47)

Add feedback

F-coref: Fast, Accurate and Easy to Use Coreference Resolution

Otmazgin, Shon, Cattan, Arie, Goldberg, Yoav

arXiv.org Artificial IntelligenceOct-25-2022

We introduce fastcoref, a python package for fast, accurate, and easy-to-use English coreference resolution. The package is pip-installable, and allows two modes: an accurate mode based on the LingMess architecture, providing state-of-the-art coreference accuracy, and a substantially faster model, F-coref, which is the focus of this work. F-coref allows to process 2.8K OntoNotes documents in 25 seconds on a V100 GPU (compared to 6 minutes for the LingMess model, and to 12 minutes of the popular AllenNLP coreference model) with only a modest drop in accuracy. The fast speed is achieved through a combination of distillation of a compact model from the LingMess model, and an efficient batching implementation using a technique we call leftover batching. Our code is available at https://github.com/shon-otmazgin/fastcoref

computational linguistic, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2209.0428

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
Asia > China > Hong Kong (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Machine Learning for Entity Coreference Resolution: A Retrospective Look at Two Decades of Research

Ng, Vincent (University of Texas at Dallas)

AAAI ConferencesFeb-14-2017

In general, which entity mentions in a text or dialogue refer to the same however, the difficulty of coreference resolution stems from real-world entity. Despite being actively investigated for 50 its reliance on sophisticated knowledge sources and inference years in the natural language processing (NLP) community, mechanisms (Mitkov et al. 2001). Despite its difficulty, it is still far from being solved. To better understand the difficulty coreference resolution is a core task in information extraction: of the task, consider the following sentence: it is the fundamental technology for consolidating the textual information about an entity, which is crucial for essentially The Queen Mother asked Queen Elizabeth II to transform all high-level NLP applications, such as question her sister, Princess Margaret, into a viable answering, text summarization, and machine translation.

artificial intelligence, machine learning, natural language, (19 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: North America > United States (0.68)

Industry: Government > Regional Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Ensemble-Based Coreference Resolution

Rahman, Altaf (University of Texas at Dallas) | Ng, Vincent (University of Texas at Dallas)

AAAI ConferencesJul-19-2011

Employing different We investigate new methods for creating and applying coreference models to create ensembles bears resemblance ensembles for coreference resolution. While to Pang and Fan's [2009] approach, where an ensemble of existing ensembles for coreference resolution are pairwise models is applied to Chinese coreference resolution, typically created using different learning algorithms, but contrasts with the vast majority of existing approaches, clustering algorithms or training sets, we where an ensemble of coreference systems is typically created harness recent advances in coreference modeling by employing different learning algorithms [Munson et and propose to create our ensemble from a variety al., 2005] or clustering algorithms [Ng, 2005], or perturbing of supervised coreference models. However, the training set using meta-learning techniques such as the presence of pairwise and non-pairwise coreference bagging and boosting [Ng and Cardie, 2003; Kouchnir, 2004; models in our ensemble presents a challenge Vemulapalli et al., 2009].

antecedent, coreference resolution, ensemble, (16 more...)

AAAI Conferences

Twenty-Second International Joint Conference on Artificial Intelligence

Country: North America > United States > Texas > Dallas County > Richardson (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Add feedback

Narrowing the Modeling Gap: A Cluster-Ranking Approach to Coreference Resolution

Rahman, A., Ng, V.

Journal of Artificial Intelligence ResearchFeb-25-2011

Traditional learning-based coreference resolvers operate by training the mention-pair model for determining whether two mentions are coreferent or not. Though conceptually simple and easy to understand, the mention-pair model is linguistically rather unappealing and lags far behind the heuristic-based coreference models proposed in the pre-statistical NLP era in terms of sophistication. Two independent lines of recent research have attempted to improve the mention-pair model, one by acquiring the mention-ranking model to rank preceding mentions for a given anaphor, and the other by training the entity-mention model to determine whether a preceding cluster is coreferent with a given mention. We propose a cluster-ranking approach to coreference resolution, which combines the strengths of the mention-ranking model and the entity-mention model, and is therefore theoretically more appealing than both of these models. In addition, we seek to improve cluster rankers via two extensions: (1) lexicalization and (2) incorporating knowledge of anaphoricity by jointly modeling anaphoricity determination and coreference resolution. Experimental results on the ACE data sets demonstrate the superior performance of cluster rankers to competing approaches as well as the effectiveness of our two extensions.

coreference model, coreference resolution, mention-pair model, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3120

AI Access Foundation

10694

Journal of Artificial Intelligence Research

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Education (0.47)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
(4 more...)

Add feedback

Distantly Labeling Data for Large Scale Cross-Document Coreference

Singh, Sameer, Wick, Michael, McCallum, Andrew

arXiv.org Artificial IntelligenceMay-24-2010

Cross-document coreference, the problem of resolving entity mentions across multi-document collections, is crucial to automated knowledge base construction and data mining tasks. However, the scarcity of large labeled data sets has hindered supervised machine learning research for this task. In this paper we develop and demonstrate an approach based on ``distantly-labeling'' a data set from which we can train a discriminative cross-document coreference model. In particular we build a dataset of more than a million people mentions extracted from 3.5 years of New York Times articles, leverage Wikipedia for distant labeling with a generative model (and measure the reliability of such labeling); then we train and evaluate a conditional random field coreference model that has factors on cross-document entities as well as mention-pairs. This coreference model obtains high accuracy in resolving mentions and entities that are not present in the training data, indicating applicability to non-Wikipedia data. Given the large amount of data, our work is also an exercise demonstrating the scalability of our approach.

coreference, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

1005.4298

Country: North America > United States > Massachusetts (0.46)

Genre: Research Report (0.40)

Industry:

Media > Film (0.46)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.68)
(2 more...)

Add feedback